Klasifikasi Pertanyaan Bidang Akademik Berdasarkan 5W1H menggunakan K-Nearest Neighbors

نویسندگان

چکیده

Pertanyaan merupakan metode terbaik dan termudah untuk menggali sebuah informasi. Menurut aturan 5W1H, terdapat enam bentuk dasar pertanyaan yang dapat digunakan memperoleh informasi, yaitu: what, where, when, why, who, how. Banyak jurnalis menggunakan ini, karena diimplementasikan dengan cepat mudah membangun pertanyaan. Untuk membuat sistem memahami pertanyaan, misalnya seperti pada chatbot, khusus harus diterapkan membedakan keenam jenis ada. Penelitian ini mencoba melakukan klasifikasi terhadap dokumen berdasarkan tokenisasi stemming tahap pra-pemrosesan, kemudian K-Nearest Neighbors (K-NN) mengklasifikasikan Berdasarkan hasil pengujian, nilai akurasi tertinggi adalah 70.27% k = 5.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Search K Nearest Neighbors on Air

While the K-Nearest-Neighbor (KNN) problem is well studied in the traditional wired, disk-based client-server environment, it has not been tackled in a wireless broadcast environment. In this paper, the problem of organizing location dependent data and answering KNN queries on air are investigated. The linear property of wireless broadcast media and power conserving requirement of mobile device...

متن کامل

k*-Nearest Neighbors: From Global to Local

The weighted k-nearest neighbors algorithm is one of the most fundamental nonparametric methods in pattern recognition and machine learning. The question of setting the optimal number of neighbors as well as the optimal weights has received much attention throughout the years, nevertheless this problem seems to have remained unsettled. In this paper we offer a simple approach to locally weighte...

متن کامل

Predicting Medical Conditions Using k-Nearest Neighbors

As the healthcare industry becomes more reliant upon electronic records, the amount of medical data available for analysis increases exponentially. While this information contains valuable statistics, the sheer volume makes it difficult to analyze without efficient algorithms. By using machine learning to classify medical data, diagnoses can become more efficient, accurate, and accessible for t...

متن کامل

Optical Character Recognition, Using K-Nearest Neighbors

The problem of optical character recognition, OCR, has been widely discussed in the literature. Having a hand-written text, the program aims at recognizing the text. Even though there are several approaches to this issue, it is still an open problem. In this paper we would like to propose an approach that uses K-nearest neighbors algorithm, and has the accuracy of more than 90%. The training an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: JEPIN (Jurnal Edukasi dan Penelitian Informatika)

سال: 2021

ISSN: ['2548-9364', '2460-0741']

DOI: https://doi.org/10.26418/jp.v7i1.45322